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Abstract 



We consider a multivariate version of the so-called Lancaster problem of character- 
izing canonical correlation coefficients of symmetric bivariate distributions with identical 
\^ I marginals and orthogonal polynomial expansions. The marginal distributions examined in 

fSJ . this paper are the Dirichlet and the Dirichlet-Multinomial distribution, respectively on the 

continuous and the TV-discrete d-dimensional simplex. Their infinite-dimensional limit dis- 
tributions, respectively the Poisson-Dirichlet distribution and the Ewens' sampling formula, 
are considered as well. We study in particular the possibility of mapping canonical corre- 
lations on the c?-dimensional continuous simplex (i) to canonical correlation sequences on 
the d + 1-dimensional simplex and/or (ii) to canonical correlations on the discrete simplex, 
. and viceversa. Driven by this motivation, the first half of the paper is devoted to providing 

a full characterization and probabilistic interpretation of |n|-orthogonal polynomial kernels 
(i.e. sums of products of orthogonal polynomials of the same degree |n|) with respect to the 
mentioned marginal distributions. Orthogonal polynomial kernels are important to over- 
^ . come some non-uniqueness difficulties arising when dealing with multivariate orthogonal (or 

bi-orthogonal) polynomials. We establish several identities and some integral representa- 
. tions which are multivariate extensions of important results known for the case d = 2 since 

the 1970's. These results, along with a common interpretation of the mentioned kernels in 
terms of dependent Polya urns, are shown to be key features leading to several non-trivial 
' solutions to Lancaster's problem, many of which can be extended naturally to the limit as 

Q; d-^oo. 



1 Introduction. 



^ • Let TT be a probability measure on some Borel space {E,S) with C M. Consider an ex- 

changeable pair {X, Y) of random variables with given marginal law vr. Modeling tractable 
joint distributions for (X, Y) with vr as given marginals is a classical problem in Mathematical 
Statistics. One possible approach, introduced by Oliver Lancaster [29] is in terms of so-called 
canonical correlations. Let {Pn}^=o be a family of orthogonal polynomials with weight measure 
TT i.e. such that 

(P„(X)P„(X)) = —5nm, n,m G Z+ 

Cm 

for a sequence of positive constants {cm}- Here 6mn = 1 if n = m and otherwise, and 
denotes the expectation taken with respect to n. 

A sequence p = {pn} is the sequence of canonical correlation coefficients for the pair {X, Y) if 
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it is possible to write the joint law of {X, Y) as 



gp{dx,dy) = 'n{dx)-n{dy) < 



PnCnPn{x)Pn{y) \ • (l-l) 



Suppose that the system {Pn\ is complete with respect to L2(vr), that is, every function / with 
finite vr- variance admits a representation 

oo 

f{x) = Y,J{^ynPn{x) (1.2) 
n=0 

where 

/(n) = , n = 0,1,2,... (1.3) 

Define the regression operator by 

Tf{x) :=E(/(y)|X = x). 
If (X, y) have canonical correlations {pn} then, for every / with finite variance, 

oo 

Ppfi^) = YPnf{n)CnPn{x). 



n=0 



In other words. 
In particular, 
and 



Tpf{n)=pnf{n), n = 0,1,2,.... 

TpPfi — PnPm ^ — 0, 1, . . . 



TpPm{n) = 6mnPmPm, m, n = 0, 1, . . . 

which means that the polynomials {Pn} are the eigenfunctions and p is the sequence of eigenval- 
ues of T. Lancaster's problem is therefore a spectral problem whereby regression operators with 
given eigenfunctions are uniquely characterized by their eigenvalues. Because Tp maps positive 
functions to positive functions, the problem of identifying canonical correlation sequences p is 
strictly related to the problem of characterizing so-called positive- definite sequences. 



In this paper we consider a multivariate version of Lancaster's problem, when vr is taken to be 
the either the Dirichlet or the Dirichlet-Multinomial distribution (notation: and DMa^N, 
with a G M^|_ and N € Z-|_) on the {d — l)-dimensional continuous and A^-discrete simplex, 
respectively: 

A(rf_i) := {x G [0, 1]^ : \x\ = 1} 

and 

iVA(rf_i) := {m e : |m| = N}. 

The eigenfunctions will be therefore represented by multivariate Jacobi or Hahn polynomials, 
respectively. One difficulty arising when d > 2 is that the orthogonal polynomials Pn = Pnin2...ni 
are multi-indexed. The degree of every polynomial Pn is |n| := ni + ■ + Ud- For every integer 
|n| there are 

\n\+d- V 
d-1 
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polynomials with degree \n\, so when d > 2 there is not an unique way to introduce a total 
order in the space of all polynomials. One way to overcome such a difficulty is by working with 
orthogonal polynomial kernels. 

By |n|-orthogonal polynomial kernels with respect to vr we mean functions of the form 

P\n\{x,y)= Yl CmPm{x)Pm{y), |n| = 0, 1, 2, . . . (1.4) 

mGZ5j.:|m.| = |n 

where \m\ = mi + • • • + for every m S lA^. Polynomial kernels are uniquely defined and 
totally ordered. 

A representation equivalent to (1.2) in term of polynomial kernels is: 

oo 
|n|=0 

If / is a polynomial of order \m\ the series terminates at |77t,| . Consequently, for general d> 2, the 
individual orthogonal polynomials Pn{x) are uniquely determined by their leading coefficients 
of degree \n\ and i-]„|(x,y). If a leading term is 

d 

{A:;|fc| = |n|} 1 



then 



Pn{x)= Y bnkE[flyt'P\n\{x,Y)], (1.6) 

{fc:|A;| = |n|} 1 

where Y has distribution vr. 
It is easy to check that 

E,[P|„i(x,y)PH(z,y)] =P\ 

P^n\{x,y) also has an expansion in terms of any complete sets of biorthogonal polynomials of 
degree |n|. That is, if {P^(x)} and {P°(a;)} are polynomials orthogonal to polynomials of 
degree less that \n\ and 

E 

then 



p:{x)p:,{x)] =5„ 



P\n\{x,y)= Yl Pn{x)P°{y)- (1-7) 
{n:|7i| fixed} 



Biorthogonal polynomials always have expansions 



Pn{x) — ^ ^ '^nmPmix) 

{m:\m\ = \n\} 

Pn{x) = Y^ (^nmPmix), (1.^ 

{m:\m\ = \n\} 



where the matrices of coefficients satisfy 



^oT^o ^ equivalent to C° = C^"^^. 
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Similar expressions to (1.6) hold for P^ix) and Pnix) using their respective leading coefficients. 
This can be shown by using their expansions in an orthonormal polynomial set and applying 
(1.6). 

The polynomial kernels with respect to and DMa,N will be denoted by Qj^|(a;,y) and 
-ff|"l(r, s), and called Jacobi and Hahn kernels, respectively. 



This paper is divided in two parts. The goal of the first part is to describe Jacobi and Hahn 
kernels under a unified view: we will first provide a probabilistic description of their structure 
and mutual relationship, then we will investigate their symmetrized and infinite-dimensional 
versions. 

In the second part of the paper we will turn our attention to the problem of identifying canonical 
correlation sequences with respect to Da and DMa- We will restrict our focus on sequences p 
such that, for every ?i G 

Pn = P\n\- 

For these sequences, Jacobi or Hahn polynomial kernels will be used to find out conditions for 
a sequence to satisfy the inequality 

oo 

P\m\P\m\{u,v)>0. (1.9) 

|n|=0 



Since Tp is required to map constant functions to constant functions, a straightforward necessary 
condition is always that 

Po = 1- 

For every d = 2,3, . . . and every a G M^^ we will call any solution to (1.9) an a-Jacobi positive 
definite sequence (a-JPDS) if vr = and an a-Hahn postive- definite sequence (a-HPDS) if 
vr = DMo,. 

We are interested, in particular, in studying if and when one or both the following statements 
are true. 

(PI) For every d and a. G p is a-JPDS <^ p is a- JPDS for every 5 G ]R++^ : |5| = |a|; 
(P2) For every (i and a G M+ p is a-JPDS <^ p is a-HPDS. 

Regarding (PI), it will be clear in Section 7 that the sufficiency part (<^) always holds. To 
find conditions for the necessity part (=^>) of (PI), we will use two alternative approaches. The 
first one is based on a multivariate extension of a powerful product formula for the Jacobi 
polynomials, due to Koornwinder and finalized by Gasper in the early 1970's: for a, /3 in a 
"certain region" (see Theorem 3.51 further on), the integral representation 

holds for a probability measure nix^y on [0,1]. Our extension for multivariate polynomial kernels, 
of non-easy derivation, is found in Proposition 5.4 to be 

Qf„|(x,y)=E[Q;:;|'l"l-"''(Zrf,l)] , |n|=0,l,... (1.10) 

for every d and a G in a "certain region," and for a particular [0, l]-valued random variable 
Zd- Here, for every j = \, . . . ,d, ej = (0, 0, . . . , 1, 0, . . . , 0) G is the vector with all zero 
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components except for the j-th coordinate which is equal to 1. Integral representations such 
as (1.10) are useful in that they map immediately univariate positive functions to the type of 
bivariate distribution we are looking for: 

f{x) > ^ E [f{Za)] = / WQ|n| {x, y) > 0. 

\n\ 

In fact, whenever (1.10) holds true, we will be able to conclude that (PI) is true. 
The identity (1.10) holds only with particular choices of the parameter a. At the best one 
needs one of the a^'s to be greater than 2. This makes it hard to use (PI) to build canonical 
correlations with respect to Poisson-Dirichlet limit marginals on the infinite simplex. The latter 
would be a desirable aspect for modeling dependent measures on the infinite symmetric group 
or for applications e.g. in Nonparametric Bayesian Statistics. 

On the other hand, there are several examples in the literature of positive-definite sequences 
satisfying (PI) for every choice of a, even in the limit case of |q;| = 0. Two notable and 
well-known instances are 



(i) 



p|„|(i) = e-il"l(l"l+l"l-i)*, |n| =0,1,... 

arising as the eigenvalues of the transition semigroup of the so-called d-type, neutral 
Wright-Fisher diffusion process in Population Genetics. 



(ii) 



P|„|(z) = >l, |n|=0,l. 



■'I 

i.e. the eigenvalues of the so-called Poisson kernel, whose positivity is a well-known result 
in Special Functions Theory (see e.g. [18], [7]). 

A probabilistic account of the relationship existing between examples (i) and (ii) is given in 
[12]. 

It is therefore natural to ask when (PI) holds with no constraints on the parameter a. 
Our second approach to Lancaster's problem will answer in part to this question. This approach 
is heavily based on the probabilistic interpretation of the Jacobi and Hahn polynomial kernels 
shown in the first part of the paper. We will prove in Proposition 8.1 that, if {d^^^ : m = 
0,1,2,...} is a probability mass function (pmf) on Z+, then every positive-definite sequence 
{p\n\]'^\=Q of the form 

|m|!r(|a| -|- |m|) 

71 , 1 ixiT^.i I , I 1 , I u Qlmh 7TT, =0, 1,... (1.11) 

( m — n )!i ( a -|- m -|- \n\) ' ' 

\m\ = \n\ 

satisfies (PI) for every choice of a, therefore (PI) can be used to model canonical correlations 
with respect to the Poisson-Dirichlet distribution. 

In Section 9 we investigate the possibility of a converse result, i.e. will find a set of conditions 
on a JPD sequence p to be of the form (1.11) for a pmf 

As for Hahn positive-definite sequences and (P2), our results will be mostly consequence of 
Proposition 3.1, where we establish the following representation of Hahn kernels as mixtures of 
Jacobi kernels: 



|iV|!r(|a| + \N 



\n\ =0,1,... 
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for every G Z+ and r, s G A^A(^„;^), where the expectation on the right-hand side is taken 
with respect to D^+r ® Da+s i-e. a product of posterior Dirichlet probability measures. A 
similar result was proven by [17] to hold for individual Hahn polynomials as well. 
We will also show (Proposition 6.1) that a discrete version of (1.10) (but with the appearance 
of an extra coefficient) holds for Hahn polynomial kernels. 

Based on these findings, we will be able to prove in Section 7.2 some results "close to"(P2): we 
will show that JPDSs can be viewed as a map from HPDSs and also the other way around, but 
such mappings are not in general the identity (i.e. (P2)). 

On the other way, we will show (Proposition 7.8) that every JPDS is indeed the limit of a 
sequence of (P2)-positive-definite sequences. 

Our final result on HPDSs is in Proposition 8.6, where we prove that if, for every N, a JPDS 
p is of the form 1.11, for a probability distribution = {d|^|^}|m|GZ+ such that d\^^ = for 
/ > N, then (P2) holds properly. Such sequences also satisfy (PI) and admit infinite-dimensional 
Poisson-Dirichlet (and Ewens' sampling distribution) limits. 

The key for the proof of Proposition 8.6 is provided by Proposition 3.5, where we show the 
connection between our representation of Hahn kernels and a kernel generalization of a product 
formula for Hahn polynomials, proved by Gasper [11] in 1973. Proposition 3.5 is, in our opinion, 
of some interest even independently of its application. 

1.1 Outline of the paper. 

The paper is organized as follows. Section 1.2 will conclude this Introduction by recalling some 
basic properties and definitions of the probability distribution we are going to deal with. In 
Section 2 an explicit description of Q? , is given in terms of mixtures of products of Multinomial 
probability distributions. We will next obtain (Section 3) an explicit representation for as 
posterior mixtures of Q? , . This is done by applying an analogous relationship which was proved 
in [17] to hold for individual orthogonal polynomials. In the same section we will generalize 
Gasper's product formula to an alternative representation of Hfi, and will describe the connec- 
tion coefficients in the two representations. In Sections 4-4.2 we will then show that similar 
structure and probabilistic descriptions also hold for kernels with respect to the ranked versions 
of Da and DM^, and to their infinite-dimensional limits, known as the Poisson-Dirichlet and 
the Ewens' sampling distribution, respectively. As an immediate application, symmetrized Ja- 
cobi kernels will be used in section 4.1 to characterize individual (i.e. not kernels) orthogonal 
polynomials with respect to the ranked Dirichlet distribution. This will conclude the first part. 
Sections 5-6 will be the bridge between the first and the second part of the paper. We will 
prove the identity (1.10) for the Jacobi product formula and its Hahn equivalent. We will point 
out the connection between (1.10) and another multivariate Jacobi product formula due to 
Koornwinder and Schwartz [27]. In section 7 we will use the results of Section 5 to characterize 
sequences obeying to (PI), with constraints on a, and will investigate the existence of sequences 
satisfying (P2). 

In section 8 we will find sufficient conditions for (PI) to hold with no constraints on the param- 
eters, when a JPDS can be expressed as a linear functional of a probability distribution on 
We will discuss the possibility of a converse mapping from JPDSs to probability mass functions 
in Section 9. 

Finally, in Section 8.2 we will make a similar use of probability mass functions to find sufficient 
conditions for a proper version of (P2). 
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1.2 Elements from Distribution Theory. 

We briefly list the main definitions and properties of the probability distributions that will be 
used in the paper. We also refer to [17] for further properties and related distributions. For 
a, n S denote 

d 

r(a) = nr(a,) 



and 



Also, we will use 



=1 



\n\ \ \n\\ 
n 



r(a+x) 

r(a) 

r(a+l) 
r(a+l-x-) ' 



whenever the ratios are well defined. Here 1:=(1,1,...,1). 



If X G Z+ then (a)(2,) = a(a-|- 1) • • • (a + x — 1) and [d)y^^^ = a{a — 1) • • • (a — rr-l- 1). will denote 
the expectation under the probability distribution fj,. The subscript will be omitted when there 
is no risk of confusion. 



Definition 1.1. 



(i) Dirichlet (a) distribution, a G 



pd . 



T(\a\)x°'~- 

D^idx) := I{x G A^a-i))dx. 

1 [a) ^ ' 



(a) Dirichlet- Multinomial (a,iV) distribution, a G M^^, G : 



DM^ir;N) 



E 



Da 



r 



r\\ l«j(r) 
r J (|a|)(7V)' 



r G A^A 



(d-i)- 



(1.12) 



Define the ranking function ip : M"' — )• as the function reordering the elements of any vector 
y G M"' in decreasing order. Denote its image by 

V'(y) = / = (y|,•••,yd)• 
The ranked continuous and discrete simplex will be denoted by A^_-|^ = ^p{Ad-l) and A^A^_^ = 
ip{NA(i-i), respectively. 
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Definition 1.2. The Ranked Dirichlet distribution with parameter a S W^, is 

where Sd is the group of all permutations on {1, ... ,d} and ax = (^^.(i), . . . , Xo-(rf)). 
Similarly 

DMii ■,N) ■.= DMU ;iV)oV-' 
defines the Ranked Dirichlet Multinomial distribution. 



Ranked symmetric Dirichlet and Dirichlet-Multinomial measures can be interpreted as distri- 
butions on random partitions. 

For every r G Z'^ let /3j = /3j(r) be the number of elements in r G equal to j, so 

Pji''") — ^(^) is number of strictly positive components of r and "^^jli if!^i{r) = |r|. 
For each x G ^{d-i) denote 

k 

[x,r]d:= 

{jl,...,jfc)C{l,...,d}j = l 

where the sum is over all dj^] subsequences of k distinct integers, and let [x, r] be its extension to 
X G Aoo- Take a collection (^i, . . . of independent, identically distributed random variables, 
with values in a space of d "colors" {d < oo), and assume that Xj is the common probability of 
any of being of color j. The function [x, r]^ can be interpreted as the probability distribution 
of any such sample realization giving rise to k{r) distinct values whose unordered frequencies 
count /3i(r) singletons, /32(r) doubletons and so on. 

There is a bijection between r^ = ip{r) and /3(r) = (/3i(r), . . . ,/3|j,|(r)), both maximal invariant 
functions with respect to Sd, both representing partitions of \r\ in k{r) parts. Note that [x,r]rf 
is invariant too, for every d < oo. It is well-known that, for every x G A^, 



that is, for every x, 



[x,r^]d 



.r-va>iA(r^)!' 

represents a probability distribution on the space of random partitions of \r\. 

For \a\ > 0, let D^a\,dj DM^a\,d denote the Dirichlet and Dirichlet-Multinomial distributions 

with symmetric parameter {\a\/d, . . . ,\a\/d). Then 

rl Um/d)% 



rl \a^^ 



, • := ESF^^^ir) (1.15) 

Definition 1.3. The limit distribution ESF^^^{r) in (1.15) is called the Ewens Sampling For- 
mula with parameter \a\. 
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Poisson-Dirichlet point process ([23]). 

Let Y°° = {Yi,Y2, . . .) be the sequence of points of a non-homogeneous point process with 
intensity measure 

N\a\{y) = \a\y~^e-y. 

The probabihty generating functional is 

^|,|(0=]Eh (expjy loge(y)A^H(rfy)}) =exp||a|^ iC{y) - l)y~' dy^ (1.16) 

for suitable functions ^ : M — )• [0,1]. Then \Y^\ is a Gamma(|a|) random variable and is 
independent of the sequence of ranked, normalized points 



^p{Y'^) 



Definition 1.4. The distribution of X^°°, is called the Poisson-Dirichlet distribution with pa- 
rameter \a\. 

Proposition 1.5. (i) The Poisson-Dirichlet (|a|) distribution on Aqo is the limit 

PD\^\ = hm Df. ^. 



(ii) The relationship between and DM^ is replicated by ESF, which arises as the (sym- 
metric) moment formula for the PD distribution: 

ESF\^\ (r; N) = Epz.,., | (^J) ^^^^ [x, r^] | , r G iVA^-. (1.17) 

Proof. If y = (Yi^ . . . ,Yfi) is a collection of d independent random variables with identical 
distribution Gamma(|a|/(i, 1), then their sum \y\ is a Gamma(|a|) random variable independent 
of y/|y|, which has distribution Da\,d- The probability generating functional of Y is ([13]) 

J-|„l(0 (1,18) 
which, by continuity of the ordering function ijj, implies that if X^'^ has distribution D^^^ ^, then 

This proves (i). For the proof of (ii) we refer to [13]. □ 
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2 Polynomial kernels in the Dirichlet distribution. 

2.1 Polynomial kernels for d >2. 

The aim of this section is to prove the foUowing 

Proposition 2.1. For every a G M^j. and every integer \n\, the \n\-th orthogonal polynomial 
kernel with respect to Da is given by 

\n\ 
|mj=0 

where 

ajl, , = +2|n| - 1)(-1)I"I-H ^^±H)^ (2.20) 
\n\\m\ VI I II J\ J \m\\{\n\ - \m\y. ^ ' 

form a lower-triangular, invertible system, and 

^u-^y) = E ('7')f^n(-^^^)'' (2-21) 

|,|=H V * / ill 1 

An inverse relationship is 

<;H(^.!/)-'+E (|„|'^|t|),,„,, Qr.|(^.i')- 

A first construction of the Kernel polynomials was given by [13]. We provide here a revised 
proof. 

Proof. Let {Q^} be a system of orthonormal polynomials with respect to Da (i.e. such E((5° ^) = 
1). We need to show that, for independent Dirichlet distributed vectors X,Y, if |n|, \k\ < \m\, 
then 

EU\'^^{X,Y)Q°^iX)QUY)) = 5nk ,, '"".'""f, • (2.24) 

^ ' ' ^ + l"i|){|n|) 



If this is true, an expansion is therefore 



|n|=l ^' " {n:|ri| nxed } 

l"^l I I 

Inverting the triangular matrix with (m, n)th element 



{\a\ + |m|)(|„|) 
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gives (2.19) from (2.23). The inverse matrix is triangular with {\m\, |n|)th element 

(|a| +2|n| - l)(-l)l"l I I,. I I I ' / , \n\>\m\ 

\m\\{\n\ — \m\)\ 

and the proof will be complete. 
Proof of (2.24). Write 

1 0:KI = H} ^ ^ ^ 1 ^M + m){\n\) 

Expressing the last product in (2.26) as 

d-l d-l d~l 

1 1 {fc:|fc|<|n|} 1 

for constants bnk, from the identity 



k*=0 

shows that 



(r + a*)(„.) = ( I. ) + «*)(n*-fc*)^[fc. 

i.*_n \ / 



e( n ^^"^er.i(^, I = (^^V]!^ n + ^oin (2.27) 



Thus if |n| < \k\ < \m\, 

"I^Aj . I Ai I — 1^1} 

IH[H] Qo^^Y) + R,{Y), (2.28) 



(|a| + |m|)(i„|) 
where 

d-l 

ttnk n 

{fc:lA;| = |n.|} 1 

are terms of leading degree \n\ in Q^{X) and i?j(y), j = 0, 1, 2 are polynomials of degree less 
than |n| in Y. Thus if |n| < |A;| < m, 

E{c\'mliX,Y)Ql{X)QUY)) = £;(Q°(y){^^MLQ°^j(y)+^,(y)|) 

(2.29) 



(|a| + |m|)(|„|) 

By symmetry (2.29) holds for all n, k such that |n|, \ k\ < m. □ 



2.2 Some properties of the kernel polynomials. 
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2.2.1 Particular cases. 

Qo = 1, 

,Q? = (|a| + l)(6-l) 

d 

= (|a| + l)f|«| y^^Xiyi/aj - l) 
1 

Q2 = ^(|«|+3)((|a|+2)6-2(|a| + l)6 + |a|) , 

where 

d 

^2 = |a|(|a| + l)(^'^ixiyif/ai{ai + 1) + 2'^XiXjyiyj/aiaj 
1 i<j 



2.2.2 The j-th coordinate kernel. 



A well-known property of Dirichlet measures is that, if y is a Dirichlet(a) vector in A(-^_i) 
then its j-th coordinate Yj has distribution D^.^^^^_^.. Such a property is reflected in the Jacobi 
polynomial kernels. For every d let ej be the vector in M'^ with every z-th coordinate equal 6., 
i,j = 1, . . . ,d. Then 



I], 



(l«l)(|m|) _ 
(«j)(|m|) ' 



|m| E Z+, y £ A(rf„i). 



In particular, 



r (e- e.) - MlHlx^ 



Therefore, for every d and a e (2.30) implies 

|n| 
|m|=0 



(«j)(|m|) 



Qrn|(y 



where 



Qn\x,l) 
Qn'^(l,l) 



-n, n + — 1 

/3 



1 — X 



n = 0,1,2,.. 



(2.30) 



(2.31) 



{yj), i = l,...,d, ye A(d_i).. (2.32) 



(2.33) 



are univariate Jacobi Polynomials (a > 0, /3 > 0) normalized by their value at 1 and 



1 



E 



In (2.33), pFq, p, q G N, denotes the Hypergeometric function (see [1] for basic properties). 
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Remark 2.2. For a,f3 £ M+, let9 = a + (3. It is known (e.g. [17], (3.25)) that 

' nK. . ' i"^"^ (2.34) 



On i/ie oi/ier hand, for every a = (ai, . . . , a^), 

Cl "'-"^ = QU-.e,) = X; «K|H^^- (2-35) 

|m|=0 ^ ^H\m\) 

Thus the identity 

(l"|-"i)(H) _ ^ j'l^lV iyn|-M (l«l + l"|-^)(H) (2.36) 



/ioZds /or every \a\ positive and < aj < \a\ . In the limit as aj — ?• |a|, this implies 

/I l\ 

^ (-l)l"l-H(|«| + H)(|„j„,)=0. (2.37) 

|m|=0 ^' 

2.2.3 Addition of variables in x. 

Let A be a (i' X d (d' < d) 0-1 matrix whose rows are orthogonal. A known property of 
the Dirichlet distribution is that, if X has distribution D^, then ylA has a Dau distribution. 
Similarly, with some easy computation 

E(Cp^|(A,y) I AX = ax) = ^f^^{AX,Ay). 

One has therefore the following 

Proposition 2.3. A representation for Polynomial kernels in Dacx is: 

QinAx,Ay) =E\QUX,y) \ AX = Ax] . (2.38) 



Example 2.4. For any a € 

Then, denoting a' = ai + ■ ■ 



¥^ and k < d, suppose AX = (Ai-|-- • ■+Xk, Afc+i-|-- 
+ Ofc and f3' = ak+i + • • • + a^, one has 



■+Xd)=X'. 



^'f'R^'f'{x')R^'f'{y') 

n\ \n\ ^ ' \n\ ' 



E 



Qt\{X,y)\X' 



X 



3 Kernel Polynomials on the Dirichlet-Multinomial distribu- 
tion. 

For the Dirichlet-Multinomial distribution, it is possible to derive an explicit formula for the 
kernel polynomials by considering that Hahn polynomials can be expressed as posterior mixtures 
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of Jacobi polynomials (cf. [17], 5.2). Let |Q°(rE)} be a orthonormal polynomial set on the 
Dirichlet, considered as functions of {xi, . . . jXd^i). Define 



Kir ;|r|) = / Ql{x)Da+ridx), 



(3.39) 



then {/i^} is a system of multivariate orthogonal polynomials with respect to DMa with constant 
of orthogonality 



'o,|r| 



K{R;\r\ 



(l«l + kl)(|n|)' 

Note also that if |r| — )• oo with ri/\r\ Xi, i = 1, . . . ,d, then 

lim /i°(r ; |r|) = Q°(x). 
\r\^oc 

Proposition 3.1. The Hahn Kernel polynomials with respect to DMa(- \\r\) are 



n|(^>s) 



(«+ kl)(|n|) 



Q\n\ {x, y)Da+ridx)Da+s (dy) 



for r = {ri, . . . ,rd), s = {si, . . . , Sd), \r\ = \s\ fixed, and |?7,| = 0, 1, . . . , |r| 
An explicit expression is 



|a| + rl)(|n|) ,H,ai 



'^\n\\m\^\m\ ^ 



■m=0 



where (aj^j|,^|) is as in (2.20) and 



|/| = |m| 



m\\ |Q|(|m|) Uii(^i+ri){k){ai + Si)i^i^) 



E 



£'M^+^(/;|m|)Z^Ma+,(/;|m| 
L>Afa(/; |m|) 



(3.40) 



(3.41) 



(3.42) 



(3.43) 
(3.44) 



Proof. The Kernel sum is by definition 

^W(^«) = ^'"'|^|'[^'^^'"'^ E ^nir ; |r|)/i°(s ; |r|) (3.45) 

^1""^^ {n;|n| fixed} 

and from (3.41), (3.42) follows. The form of is obtained by taking the expectation 

,^P^|(X, y), appearing in the representation (2.19) of Q^^]^ with respect to the product mea- 
sure Da+T-Da+s- n 

The first polynomial kernel is 

/ ^ (|«| + l)(|a| +r) / \a\ A {uj + ri){a, + Sj) _ A 
\a\ \{\a\ + |r|)2 ^ Ui )' 



14 



Projections on one coordinate. 



As in the Jacobi case, the connection with Hahn polynomials on {0, . . . , N} is given by marginal- 
ization on one coordinate. 

Proposition 3.2. For \r\ G N and (i G N, denote rj^i = ej\r\ G N'^, where ej = (0, . . . , 0, 1, 0, . . . , 0) 

with 1 only at the j-th coordinate. 
For every a G N*^, 



Hl'n\is,e,\r\) 



1 , o(aj ,\a\—aj) , i :s,o{aj,\a\—aj). 



where 



Jq| \n 
'\r\,\n\ 



r ; r 



\s\ = r 



E 



h;[i''\R;\r\ 



l^l'l"!' (I«I + H)(H) 
and are orthogonal polynomials with respect to -DAf^,^ (•; |r|). 

Proof. Because for every d and a G W!^ 

\n\ 



^1 

■|r|,|n| |m|=0 



for d = 2 and a, /? > with a + /3 = |a| 



|n| 



\m\=0 



Now rewrite £ as 



i:'Ma+,(;;|m|)7:)Ma+^(;;|m|) 
DMa{l; \m\) 



E 

|i| = |m| 



|i|=H 



Consider without loss of generality the case j = 1 . Since for every a 

DMa{l; \m\) = DM^^^\^\_^^{li; \m\)DMa2^,„^ai{h, ■■■Jd] \m\ - h] 
then, for Irl = Isl 



(3.46) 



(3.47) 



oi+si,|Ql-oi + |m|-si 



(/i; |m|)- 



r : r 



/i=0 



|m| 



Z)Ma/_l_5/(u; |m| — /i 



(M;M) 

,Z?A4+;(0:0) 



|«| = |m|— 



L>A4(0;0) 



^' \u\ = \m\-h 



h=0 
\m\ 



Y,DM^+s{h;\m\) 



li=0 



DM^,^l,{\r\;\r\) 
DMa{\r\;\r\) 



^1 



\m\ 



isi,\r\) 



(3.48) 
(3.49) 
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Then (3.46) follows immediately. 



□ 



3.1 Generalization of Gasper's product formula for Hahn polynomials. 

For d = 2 and a, j3 > the Hahn polynomials 



/iPf(r;iV)=3F2 



|n|, |n| + ^ — 1, — r 
a,-N 



1 



\n\ =0,1,...,N. 



(3.50) 



with 9 = a + (3, have constant of orthogonality 

N 



U 



N,n r=0 

The following product formula was found by Gasper [10]: 
Thus 



O W(n~i) ^ + 2n-l («)(„)• 



(-1)I"I(/3)(H) ^ (-l)'+^|n|[,+,](g + \n\ - l)(,^,)r;,p[,](iV - r\,^{N - s\k^ 

(3.51) 



AT l"l 



(-1)I"I-H(0)(I„I_,)(0 + |n| - 1)(|„|)(0 + 2|n| - 1) 



(3.52) 



|m|!(|n| - |m|)!(6l)(|„|) 



\n\ 



{0 + N)^\n\) 



m. =0 



where 



E 

i=o 



™a,/3(j;|"^l) 



('T')n.](^-0[iH-.] 



Or"!] 



f|m| 



'')^b](^-s)[H-j] 



(3.53) 



By uniqueness of polynomial kernels, we can identify the connection coefficients between the 
functions and x- 



Proposition 3.3. For every \m\,\n\ G Z+, and every r, s E {0, . . . , N}, 

\m\ 



m=o 



where 



Proof. From 3.42, 



VII 



E 

|n| = |i| 



+ ^^)(|n|)y (^ + "^)(|n|) 



(3.54) 



(3.55) 



|n| 



-^>rn| ^)^Sl ^) = ^|n| ^) = ^'"'^ E <|IH^H •'(^' (3.56) 

i V r P 



|m|=0 
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Since the array A = has inverse C = A ^ with entries 



HH- V(^ + m)(|„|) 
then equating (3.56) and (3.52) leads to 

AT, 



m\ 



|5^/HH(0 + Ar)(|„l)-H 



|n|=0 



m=o 



(3.57) 



The following Corollary is then straightforward. 
Corollary 3.4. 



E 



E 



.H,a,l3 H,a,l3 



|m|A|«| 

E 

|n|=0 



b"l[izi]KI[h 



(^ + I"^I)(H)(^ + |/|)(H)' 



□ 



For every r G A^A(^_i) and m G Z^J. define 



i=l 

Gasper's product formula (3.51), or rather the representation (3.52) has a multivariate extension 
in the following. 

Proposition 3.5. For every d, a € and N G the Hahn polynomial kernels admit the 
following representation: 



where 



Nu 



\n\ 



E\a\ H,a/ N 

\n\\m\'^\m\ ^ ' ' 



(|a| +^)(|n|) 



r,sG iVA(rf_i), |n| =0,1,... (3.58) 



|m|=0 



1 



l:\l\ = \m 

Proof. If we prove that, for every \m\ and |n 



I 

|n|=0 "^lAfllnl 



(3.59) 
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where cj"j^.| are given by (3.57) (independent of dl), then the proof fohows by inversion. 
Consider the orthonormal multivariate Jacobi polynomials Q'^{x). The functions 



Kir; N) :-- 



Qn{x)Da+r{dx) 



■(d-l) 



satisfy the identity 

E 



/i°(i?;iV){y )M(i?) 



([12], (5.71)). 

Then for every fixed s, 



Nu,mK{1 ■,\m\)DM^{l;\m\), 



(3.60) 



E 



(3.61) 



so iterating the argument we can write 



E 



\m\\n\ 



X\'^'^{R,S)K{R;N)K{S;N) 
Now by uniqueness of the polynomial kernel, 

oo ^ 

^h(^>^) = E -l^^nir ■,N)h°^{s ■,N) 



(3.62) 



|n|=0 "^Nm 



therefore 



and the proof is complete. 



|m| 



I 

|n.|=0 "^lAfllnl 



□ 



The connection coefficients between ^i^d ^|^| are, for every d, the same as for the two- 
dimensional case: 

Corollary 3.6. For every d and a G M^|_, 



\l\=0 

where (61^1 m) are given by (3.55). 



(3.63) 



E 



\.H,a H,a 
^\m.\ X.\l\ 



E 



tH,a H,a 
^\l\ ^|m| 



tm|A|«| 

E 

|n|=0 



{\a\ + |m|)(|„|)(|a| + \l\)(\n\) 



|m|,|/| =0,1,2,.., 
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3.2 Polynomial kernels on the Hypergeometric distribution. 



Note that there is a direct alternative proof of orthogonaHty of H^^{r,s) similar to that for 
In the Hahn analogous proof orthogonality does not depend on the fact that \a\ > 0. 
In particular we obtain Kernels on the hypergeometric distribution: 




(3.64) 



by replacing a by — c in (3.42) and (3.43). Again a direct proof similar to that for (5|.„|(x) would 
be possible. 



4 Symmetric kernels on ranked Dirichlet and Poisson-Dirichlet 
measures. 



Let D^a^ ii be the Dirichlet distribution on d points with symmetric parameters (|a|/(i, . . . , |a|/(i), 
and d} I , its rankf 

\a\,d 

Proposition 4.1. 



and ^ its ranked version. Denote with QjJ^"'''^^ and Qjl^"'''^''"^ the corresponding |n|-kernels. 



where summation is over all permutations a of 1^ ... ^d. The Kernel polynomials have a similar 
form to Qi'T''^^ with ^m*^''^^ replaced by 

\n\ 



DMla\/l;\m\) 



where 



m ■■- 
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Proof. Note that 



7^12 12 °H'|m,|^H'''^Va;,y) 



crGGd |m|<|ri| 



|m|<|n| 



o-eGd \l\ = \m\ 



DM\^\^a{l;\m\) 



'7') jcTTxyiyy 



^|n||»n| (^\\2 12 12 
|m|<|n| ^ a,Tegd\l\ = \m\ 



'7') i'^^YiryY 



|m|<|n| ^ o-.TeSd |/| = |m| 



(4.67) 
(4.68) 



Now, 



Q 



(l"l''^)^(x,y)Q[H''^H(,,y) 



1 



.i\a\,d)i 



(4.69) 



hence q['?'''^'*"^ is the \n\ polynomial kernel with respect to uj, , ... The second part of the 



theorem, involving the identity (4.66), is just another way of rewriting (4.67). 
Remark 4.2. The first polynomial is Q^^"^''^'^^ = 0. 

4.0.1 Infinite-dimensional limit. 



□ 



A\a\,oo)l 



\a 



\m\\ iUT hi) [x;l][y; I] 
a|^'[0!l!]^i ••• [(/fc- l)!A;!]^fc 

ESF\^\{1) 



(4.70) 
(4.71) 



Proposition 4.3. T/je \n\-polynomial kernel with respect to the Poisson-Dirichlet point process 
is given by 

\n\ 



Q 



{\a\,oo)l 
\n\ 



E 

|m|=0 



^\a\ ^{\a\,oo)i 
\n\\m\^\m\ 



(4.72) 



The first polynomial is zero and the second polynomial is 

= {Fl-^^){F2-^i)/a\ 
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where 

oo oo 

Z7. _ \ ^ ^2 Z7 _ \ ^ „, 



1 1 

and 



2_ 2|a| 



l + lal' (|a| +3)(|a| +2)(|a| + 1)2' 



4.1 Symmetric polynomials. 



We can use the symmetric kernel derived in Section 4 to obtain a system of orthogonal polyno- 
mials with weight measure given by the 
property of general polynomial kernels. 



mials with weight measure given by the distribution ^. We first need to observe a uniqueness 



Lemma 4.4. Let vr he any measure on , and assume that P = {P„ : n G N*^} is a system of 
polynomials such that 

P\n\{x,y)= Yl Pm{x)Pm{v), |n| = 0, 1, 2, . . . 
{|m| = |n|} 

forms a complete orthogonal kernel system with respect to tt. 
Then ¥ is a system of orthonormal polynomials with respect to tt. 



Proof Because P^n^{x,y) is an orthogonal kernel, we have 

E[P\k\ix,Y)Piii{z,Y)] =6\kiiiiPiii{x,z), |A;|,|/| GN. 

This can be written as 

Pm{x)E[Pm{Y)P\i\{z,Y)]=5\kmP\i\{x,z), (4.73) 

{|m| = |fc|} 

which shows that for every x 

E[Pm{Y)P\i\{x,Y)] = Pm{x)6\^\\i\. 

That is, 

Pk{x)= Yl CkmPmix) (4.74) 
|m| = |fc| 

with 

Ckm=^[Pk{Y)Pm{Y)]. 

Identity (4.74) implies that Pk{x) is "biorthogonal to itself", i.e. 

and the proof is complete. □ 
Now it is easy to derive orthogonal polynomials with respect to the symmetric measure -Dj^i ^. 
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Proposition 4.5. Let {Pi''^'''^''} he an orthonormal system of polynomials with respect to D^^^j^. 
Then the system {P^'^^'^^'^} defined by 

piMl^^) = 1 ^ (4.75) 



d\ 



where Gd is the group of all permutations of {1, . . . , d}, is orthonormal with respect to ^. 
Proof. In Section 4 we have shown that the polynomial kernel relative to -D|^| ^ is of the form 

Q\n\ (^'y)= Mn2 2^ Q\n\ 'i'^^,ry)= ^ n n (4.76) 

^ cr,T£ga \m\ = \n\ \ o- ' / \ r ' / 

The proof is then completed by Lemma 4.4. □ 
4.2 Kernel Polynomials on the Ewens' sampling distribution. 

The Ewens' sampling distribution can be obtained as a limit distribution from the unordered 
Dirichlet-Multinomial distribution -DM|^| ^ as d — )• oo. The proof of the following proposition 
can be obtained by the same arguments used to prove Proposition 4.1. 

Proposition 4.6. (i) The polynomial kernels with respect to DM} . , are of the same form 



\a\,d 



as (3.42) but with '^|^['"'''^^ replaced by 

Cj[l"l"^)^:=(d!)-^5]e^[l"l"^)(7r(r),.). (4.77) 



TT 



(ii) The Kernel polynomials with respect to ESF\a\ ^i"^ derived by considering the limit form 
^^1"'^ 0/ ^1^1 This has the same form as Cj^i''' (4-'''V> with [x;b][y;b] replaced by 
[r; b]'[s; b]' , where 

[r;6]' = (|a| +|r|)^^l)^ri,(;^)--Ti,(;^) 

and summation is over X]^™' j^j ~ Si"' bj = k, k = 1, . . . \m\. The Kernel polyno- 
mials have the same form as (3.4-2) with replaced ^^j"'^- The first polynomial 
is identically zero under this symmetrization. 

5 Integral representation for Jacobi polynomial kernels. 

This section is a bridge between the first and the second part of the paper. We provide an 
integral representation for Jacobi and Hahn polynomial kernels, extending to d > 2 the well- 
known Jacobi and Hahn product formulae found by Koornwinder and Gasper's for d = 2 ([24], 
GAS72). It will be a key tool to identify, under certain conditions on the parameters, positive- 
definite sequences on the discrete and continuous multi-dimensional simplex. 
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5.1 Product formula for Jacobi polynomials when d = 2. 

For d = 2, consider the shifted Jacobi polynomials normalized by their value at 1: 

3e obtained from the ordinary 

weight measure 



They can also be obtained from the ordinary Jacobi polynomials Pn^ {a,b > —1) with Beta 



Wa,b = {I- x)''{l + xfdx, xe[-l,l] 

via the transformation: 



R^'^ix) = ^" , ^ V ^. (5.79) 



The constant of orthogonality Q is given by (2.34). 
A crucial property of Jacobi polynomials is that, under certain conditions on the parameters, 
products of Jacobi polynomials have an integral representation with respect to a positive (prob- 
ability) measure. The following theorem is part of a more general result of Gasper [10] . 

Theorem 5.1. (Gasper (1972)) A necessary and sufficient condition for the equality 



Pn' (1) Pn (1) J-l Pn (1) 

to hold for a positive measure dfhx^y, is that a > b > —1 and either b > 1/2 or a + b > 0. If 
a + b > —1 or if a > —1/2 and a + b = —1 with x 7^ —y, then rhx^y-a,b is absolutely continuous 
with respect to Wa,b with density of the form 

dm^,y-a^b , Pn^jx) Pn\y) Pn\z) 

with (Pn = Pn'^lf/^ [Pn'^X] . 

An explicit formula for the density (5.81) is possible when a >b > —1/2: 

p:i^\x) p:i'\y) rp^'\ip) 



pf:'\i) p;i'\i) Jo Jo pr(i) 

where 



Jo Jo Pn ' (1) 



^p{x, y; n, w) = {(1 + x)(l + y) + (1 - x){l - y)}/2 + u cosa;V(l - 2;2)(1 - y2) _ 1 

and 

mab{du,du;) = ^ 2r(a + 1) _ ^2y~b~i^2b+i^^^^^^2b^^^^_ ^5 g3^ 

V7rr(a - 6)r(6 + ^) 

See [24] for an analytic proof of this formula. Note that (f){l,l;u,Lo) = 1, so dfha^biu,u}) is a 
probability measure. 

Gasper's theorem can be rewritten in an obvious way in terms of the shifted Jacobi polynomials 
H^'^ix) on [0,1]: 
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Corollary 5.2. For a,/3 > the product formula 



= / ' K'^(z)m,,j,; aAdz) (5.84) 







holds for a positive measure rux^y- a,i3 if <ind only if (3 > a and either a > 1/2 or a + {3 >2. In 

this case m^^y^^ = w.2x-i,2j/-i;/3-i,a-i where dm is defined by (5.81). The measure is absolutely 
continuous if a + f3 > 2 or if 13 > 1/2 and q + /3 > 1 with x ^ y. In this case 

m^^f{dz) = K{x,y,z)D^,p{dz) 

where 

oo 

K{x, y,z) = Y, C^K^{x)K^{y)K^{z) > 0. (5.85) 

n=0 

Remark 5.3. When a, (3 satisfy the constraints of Corollary 5.2, we will say that a,f3 satisfy 
Gasper's conditions. 

When a > 1/2, an explicit integral identity follows from (5.82)-(5.83). Let mapidu^du) = 
fh/s-i^a-i{du,duj). Then 

K''^(x)K'^(2/) = /' r R^'^{ip)m^p{du,du:), (5.86) 
Jo Jo 

where for x,y £ [0, 1] 



(p{x, y;u,uj) = xy + {1 — x){l — y) + 2u cosa;Y^x(l — x)y{l — y). (5.87) 
In (j) set X 2x — 1, y 2y — 1 to obtain (5.87). 

5.2 Integral representation for d > 2. 

An extension of the product formula (5.84) is possible for the kernel for the bivariate Dirichlet 
of any dimension d. 

Proposition 5.4. Let a G M'^ such that, for every j = 1, . . . ,d, aj < "^{Zi 1/2 < aj, 

Si=i — ^- Then, for every x,y £ A((^_i) and every integer \n\, 

Qf„|(x,2/) =E [g;;:|'l"l-"'*(Z,,l) I x,y] (5.88) 

where, for every x,y £ A(^„i), is the [0, 1] random variable defined by the recursion: 

Zi = 1; Zj = <^jDjZj^i j = 2,...,d (5.89) 

with 



{i-x;){i-Y*y i-x,{i-Vz,_,y l-y,{l-VZ,.„ 

(5.90) 

where is a random variable in [0, 1], with distribution 

dm , , v-j-i 
where dm^^y- a,(i is defined as in Corollary 5.2. 
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The Proposition makes it natural to order the parameters of the Dirichlet in a decreasing way, 
so that it is sufficient to assume that a(i) + a(2) > 2 to obtain the representation (5.88). 
Since the matrix A = {anm} is invertible, the proof of Proposition 5.4 only depends on the 
properties of the function The following lemma is in fact all we need. 



Lemma 5.5. For every \m\ E N, (i = 2, 3, . . . and a G M.'^ satysfying the assumptions of Propo- 
sition 5.4, 



("dj(|m|) 



,|m| 



x,y 



(5.91) 



where is defined as in Proposition 5.4- 



Let 6 = a + f3. Assume the Lemma is true. From (5.86) and (5.93) we know that, for every 
n = 0, 1, . . . and every s G [0, 1], 



e (^)(|m|) J^l 



\m\<\n\ 



a 



(H) 



Thus from (5.91) 



m\<\n\ 



x,y 



which is what is claimed in Proposition 5.4. 
Now we proceed with the proof of the Lemma. 

Proof. The proof is by induction. 
If d = 2, x,y G [0,1], 

m| 



Setting y = 1, the only positive addend in (5.92) is the one with j = \m\ so 

ef°f(x,l) = ^^±^.H. 
Therefore, if = a + /?, from (5.86) and (5.93) we conclude 



(5.92) 



(5.93) 



= [ z\^Wy; cAdz)- (5.94) 



Thus the proposition is true for d = 2. 
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To prove the result for any general d > 2, consider 

^ ,_n \^d/ 



(H)i 



m\ 



md=0 



(«d)(md)(l"l - 0!d)(\m\-ma) 



mGN'^^l : |m| = |m.| —m^ 



(5.95) 



where = j^^-, = (i = 1, . . . , (i - 1). 



Now assume the proposition is true for d—1. Then the inner sum of (5.95) has a representation 
like (5.91) and we can write 



ma=0 
{\a\ - Oid)Qjn\-ma) 



(ad)(md)(l«l ~ 



E 



'r^\m\-ma ~ 



(ad-l)(|m|-md) 

where the distribution of given x, y, depends only on a = (ai, . . . , a^-i). Now, set 



(5.96) 



Xd 



_ yd 
(i-yd)v^' 



and define the random variable 



Dd :-- 



(1 - Xd)il - Vd) 



Then simple algebra leads to rewriting equation (5.96) as: 



(5.97) 



iU^x,y)=W. 



(5.98) 

Now the sum in (5.98) is of the form (5.92), with a = ct^-i, /3 = a^, with m replaced by m — rrid 
and the pair (x,y) replaced by (x^,y^). Therefore we can use the equality (5.94) to obtain 



E 



(l«l)(|m|) 



(H) 



(d,z,_i)IHe($h 



(l"l)(lm|) 



E 



y\m\ 



x,y 



i'^d){\m\) 

(the inner conditional expectation being a function of Zd~i) so the proof is complete. 



(5.99) 
□ 
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5.3 Connection with a multivariate product formula by Koornwinder and 
Schwartz. 

For the individual, multivariate Jacobi polynomials orthogonal with respect to Da : a G M*^, a 
product formula is proved in [27]. For every x € A(^„i), a € and n = (ni, . . . , n^^i) : |n| = 
n, these polynomials can be written as 



d-l 

= n 



(5.100) 



where Ej = \a\ — Yli=i ^^'^ ~ ''^ ~ Yli=i The normalization is such that i?°(e^) = 1, 
where := (0, 0, . . . , 1) G W^. For an account of such polynomials see also [17]. 

Theorem 5.6. (Koornwinder and Schwartz) Let a G R"^ satisfy ad > 1/2 and, for every 
j = I, . . . ,d, Oj > Yli=j+i '^i- Then, for every x,y £ '^{d-i) there exists a positive probability 
measure dm* y.a such that, for every n G N!^, 

K{x)K{y) = f K{z)mly.a{dz). (5.101) 

Note that Theorem 5.6 holds for conditions on a which are stronger that our Proposition 5.4. 
This is the price to pay for the measure w.* ^.^ of Koornwinder and Schwartz to have an explicit 
description ( we omit it here), extending (5.83). It is possible to establish a relation between 
the measure 'fn*^ y.a{z) of Theorem 5.6 and the distribution of of Proposition 5.4. 

Proposition 5.7. Let a obey the conditions of Theorem 5.6. Denote with rux^y-a the probability 
distribution of of Proposition 5.4 and m* y.a the mixing measure in Theorem 5.6. Then 



Proof. Notice that both m* y.^ and mx,y-a are absolutely continuous with respect to Da^^\a\-ar 
snsities wi 

Proposition 5.4, 



Denote their densities with ijC-yniz) ■= ^rf"^""'"'" — (z) and fixva(z) = ^n™""''^'" — (z). From 



Now, by uniqueness, 

\m\ = \n\ 

= E CXi^)R?niy) 

\m\ = \n\ 



where (n ■= HK)''^- 

So, by Theorem 5.6 and because Rn{ed) 



(5.102) 



\\m\ = \n\ J 

= j Qf^\{z,ed) dmly.a{z), (5.103) 
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where are orthonormal polynomials. But we know that 

'-d,\a\~aa pad 



SO 



(5.104) 

Thus both fJ-x,y;a{z) and fJ-^^y-ai^) have the same Riesz-Fourier expansion 

oo 
|n|=0 

and this completes the proof. □ 



6 Integral representations for Hahn polynomial kernels. 



Intuitively it is easy now to guess that a discrete integral representation for Hahn polynomial 
kernels, similar to that shown by Proposition 5.4 for Jacobi kernels, should hold for any d > 2. 
We can indeed use Proposition 5.4 to derive such a representation. We need to reconsider 
formula (3.41) for Hahn polynomial in the following version: 



K{m;\r\) := J R^{x)Do,+r{dx) 
with the new coefficient of orthogonality 



<(m; |r|) 



1 



E 



n,\r\ 

Formula (6.105) is equivalent to 

(|a| + kl)(|n|) 



/i°(M;|r|) 



{H + \r\)i\n\)Q 



(6.105) 



(6.106) 



K(x) 



m\ = \r\ 



m 



X 



a G M"^, X G A(rf„i) (6.107) 



(see [17], 5.2.1 for a proof). 



Proposition 6.1. For a G M'^ satisfying the same conditions as in Proposition 5.4, a represen- 
tation for the Hahn polynomial kernels is: 



n,\r\' 



(|a| + \r\){\n\) 



r,s 



ad,l"|-"d 
n\ 



{KM 



\n\ < \r\ = Isl, a G 



(6.108) 



where the expectation is taken with respect to the measure: 



■,sAk)--=[ [ e[('^')z,^(i-z,)H-'= 



x,y 



D^+r{dx)Do,+s{dy), (6.109) 



where Z^i, for every x,y, is the random variable defined recursively as in Proposition 5.4- 
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Proof. From (3.41), 

+ kl)(|n|) 



Then (5.88) implies 

^r„|(r,s) = ^^ n — / / CT'"'""'(^d)"^x,y;a(d^d)I?a+r(dx)I)«+,(dy) 



SO by (6.107) 

X E IH) f f t - ^d)l''|-'m,,,;a(d^d)^a+.(dx)Z)„+,(dy), 

|fe|<|r| -'A(d_i) JA(d_i) Jo J 

and the proof is complete. □ 



7 Positive-definite sequences and polynomial kernels. 

We can now turn our attention to the problem of identifying and possibly characterizing positive- 
definite sequences with respect to the Dirichlet or Dirichlet-Multinomial probability distribution. 
We will agree with the following definition which restricts the attention to multivariate positive- 
definite sequences {pn : n G Z^} which depend on n only via |n|. 

Definition 7.1. For every d> 2 and a G M^^, call a sequence {p|n|}j^|=o a-Jacobi positive- 
definite sequence (a-JPDS) if po = 1 and, for every x,y £ A(-^_;^), 

oo 

p{x,y) = p|„|Qf„i(x,y) > 0. (7.110) 

|n|=0 

For every d > 2, a £ M.'^ and \r\ € Z+, call a sequence {/0|n|}|^|=o '^^ a-Hahn positive- definite 
sequence (a-HPDS) if po = 1 and, for every r,s £ |r|A(^_i), 

oo 

p^(r, s)=Y^ P\n\H\n\ (r, s) > 0. (7.111) 

ln|=0 

7.1 Jacobi Positivity from the integral representation. 

A consequence of the product formulae (5.84) and (5.86) is a characterization of positive-definite 
sequences for the Beta distribution. 

The following is a [0, l]-version of a theorem proved by Gasper with respect to Beta measures 
on [-1,1]. 

Theorem 7.2. (Bochner [4]), Gasper [10]). Let D^^/j be the Beta distribution on [0, 1] with 
Oi < (3. If either 1/2 < a or a + (3 > 2, then a sequence pn is positive-definite for D^^p if and 
only if 

Pn= [ K'^{z)v^,p{z) (7.112) 
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for a positive measure v with support on [0, 1]. Moreover, if 

oo 

u{x) = Y,C^PnRn{x)>0 
n=0 

with 



then 



for every Borel set A C [0, 1]. 



^Q^\Pn\ < OO, 
n=0 

iyiA)= [ u{x)D^^p{dx) (7.113) 



We refer to [4], [10] for the technicalities of the proof. To emphasize the key role played by 
(5.84), just observe that the positivity of v and (7.112) entails the representation 

oo „i 
p{x,y) ■.= Y,QnPnK'^{x)K'^{y) = / n(z)m,,y;„,/3(dz) > 0, 

and u{z) = p{z, 1), whenever u{l) is absolutely convergent. 

To see the full extent of the characterization, we recall, in a Lemma, an important property 
of Jacobi polynomials, namely: two different systems of Jacobi polynomials are connected by 
an integral formula if their parameters share the same total sum. 

Lemma 7.3. For f^i > 0, 

' K'^(l - (1 - x)z) D^Jdz) = K-^'/^+^(x) (7.114) 



and ^ 

C R'^Jixz) D^,,{dz) = ^"^1, ' k^^''-^{x). (7.115) 

Proof. We provide here a probabilistic proof in terms of polynomial kernels Q'^^ {x,y), even 
though the two integrals can also be view as a reformulation, in terms of the shifted polynomials 
FCi'^ , of known integral representations for the Jacobi polynomials {Pn'*} on [—1, 1] (a, 6 > — 1) 
(see e.g. AA 7.392.3 and 7.392.4). 

Let us start with (7.115). The moments of a Beta (a,/3) distribution on [0, 1] are, for every 
integer m < n = 0, 1, . . . 

^(rn)l^(n—rn) 



(« + /3)(„) 



Now, for every n S N, 



C C'^K'^{xz)Do,,^{dz) = /'Q°''3(xz,l)Z),,^(dz) 

JQ Jo 

= > ^ anm . . / {xz)'^Do,,^{dz) 
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and this proves (7.115). 

To prove (7.114), simply remember (see e.g. [17], 3.1) that 

P(n) 

and that 

i?-./3(x) = ""^^ 
i?^'"(0) 

So we can use (7.115) to see that 

Jo Bn' (0) P{n) Cn' 

and the proof is complete. □ 
Lemma 7.3 completes Theorem 7.2: 

Corollary 7.4. Let a < {3 with a + f3 > 2. If a sequence pn is positive- definite for D^^p, then 
it is positive- definite for D^^^^p-^, for any < /i < /3. 

Proof. By Theorem 7.2 pn is positive-definite for -D^,/? if and only if 

5^CVK'^(^)>o. 

n 

So (7.115) implies also 

n Cn 



The case for -Da_^,/3+^ is proved similarly, but using (7.114) instead of (7.115). 



□ 



For d > 2 Proposition 5.4 leads to a similar characterization of all positive-definite sequences, 
for the Dirichlet distribution, which are indexed only by their total degree, i.e. all sequences 

Pn = P\n\- 

Proposition 7.5. Let a G satisfy the same conditions as in Proposition 5.4- A sequence 

{Pn = P\n\ ■ n £ 'N} is positive- definite for the Dirichlet (a) distribution if and only if it is 
positive- definite for Di.\a\,(i-c)\a\j for every c £ (0, 1). 

Proof. Sufficiency. First notice that, since 

Q-'^(x,y) = Q^'"(l-x,l-y), (7.118) 

then a sequence is positive-definite for -Do,/? if and only if it is positive definite for so that 

we can assume, without loss of generality, that c\a\ < (1 — c)\a\. Let a = (ai > 02 > • • • > ad) 
satisfy the conditions of Proposition 5.4 (again, the decreasing order is assumed for simplicity) 
and let 

00 

|n|=0 
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If ad > c^a\ then Corollary 7.112, applied with = — C|q| implies that 

'|r 



|n|=0 

SO by Proposition 5.4 



< 



|n|=0 



|n|=0 

If < C|q|, then apply Corollary 7.112 with fi = |a|(l — c) — to obtain 



(7.119) 



|n|=0 |n|=0 

which implies again (7.119), thus {/0|„|} is positive-definite for Da. 
Necessity. For / C { 1 , . . . , , the random variables 

^/ = E^^' ^/ = E^^- 

have a Beta(a/, |q;| — a/) distribution, where a/ = J2jei'^3- Since 

ii;(g^(x,y)|Y; = z) = Q-^(z), 

then for arbitrary x,y & A(„_i), 

oo 

E P\n\Qlnl{x,y)>0 
\n\=0 

implies 

q;;-|"'-"^(x, y) = q|:|-"^'"^ (i - x, i - y) > o. 

Now we can apply once again Corollary 7.112 with fi = ib(c|a| — aj) (whichever is positive) to 
obtain, with the possible help of (7.118), 

oo 

E^h^h''^'"'^'"'(^'^)^o «,^g[0,1]. 

|n|=0 

□ 

7.2 Positive definite sequences in the Dirichlet-Multinomial distribution. 

In this Section we aim to investigate the relationship existing between JPDS and HPDS . In 
particular, we wish to understand when (P2) is true, i.e. when a sequence is both HPDS and 
JPDS for a given a. It turns out that, by using the results in Sections 3 and 6, it is possible 
to define several (sometimes striking) mappings from JPDS and HPDS and viceversa, but we 
could prove (P2) only for particular subclasses of positive-definite sequences. In Proposition 7.8 
we prove that every JPDS is a limit of (P2)-sequences. Later in Proposition 8.6 we will identify 
another (P2)-family of positive-definite sequences, as a proper subfamily of the JPDSs, derived 
in Section 8 as the image, under a specific bijection, of a probability on Z+. 

The first proposition holds with no constraints on a or d. 
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Proposition 7.6. For every d and a S W^, let p = {p\n\} be a a-JPDS. Then 

Pl-lTTTTlk ' n = 0,1,2,... 

(I«l +^)(M) 

is a positive- definite sequence for DMa{-; N) for every N = 1, 2, . . .. 
Proof. From Proposition 3.1, if 

oo 

P\n\Qfn\ix,y) > 

|n|=0 

then for every r, s G N'^ : |r| = |s|, 

y" P\n\ I I QZAx,y)De,+r{dx)D^+,{dy) = V P\n\jrYT^\ ^H^' 

ln|=o M=o ^l«l + l^l)(H) 



Two important HPDSs are given in the fohowing Lemma. 
Lemma 7.7. For every d, every \m\ < N and every a G W^, both sequences 

f \m\i\n\] (|a|+^)(|n|)l 



(l"l + iV[|' 



\n\&Z+ 



and 



|n|gZ+ 



are a-HPDSs for DM^i-; N). 

Proof. From Proposition 3.5, by inverting (3.58) we know that, for \m\ = 0, . . . , N 



so 

I \m 



\l\n\] (l«l+^)(|n.|) 
\(|a| + kl)(|n|) l^l[|n|] 

is a HPDS. 

Now let be a JPDS. By proposition 7.6, the sequence 



(H+^)(H) 



is a-HPDS. By multiphcation. 



N[in\] IH[H] (l"l+^)(H) _,^ 



(|a| +^)(|n|) (l«l + b"l)(|n|) ^[\n\] J 1 (|a| + |m| 



:\n\) 
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is HPDS as well. This also implies that 



(|a| + |m|) 



(hi) 



is HPDS (to convince oneself, take p as in Example 8.3 or in Example 8.4 and take the limit as 
t — )• or z — 7- 1 respectively). □ 

We are now ready for our first result on (P2)-sequences. 

Proposition 7.8. For every d and a G M^j., let p = {p|„|} be a a-JPDS. Then there exist a 
sequence {pj^i : |n| G Z+}|^^q such that, 

(i) for every |n|, 



(ii) for every N, the sequence {P|^|} is both HPDS and JPDS. 

Proof. We show the proof for d = 2. For d > 2 the proof is essentially the same, with all 
distributions obviously replaced by their multivariate versions. Take I, J two independent 
DMa^l3{-; N) and DMa^p[-;M) random variables. As a result of de Finetti's Representation 
Theorem, conditionally on the event {lim7v->oo(;^;^) = the (/, J) are independent 

Binomial r.v.'s with parameter {N,x) and {M,y), respectively. 
Let / : [0, 1]^ — )• M be a positive continuous function. The function 



/(- - 



x,y 



BN,Mf{x,y) := E 

is positive as well and as N, M ^ oo, 

BN,Mf{x,y) — >f{x,y). 



N,M = 0,1,... 



Now take 



|n| 



for every x,y £ [0, 1]. Then, for X, Y independent Da^/s, 



P\n\ 



E 

E 



Q^^{X,Y)p,{X,Y) 



Q";f(X,y) lim BN,NPpiX,Y) 



where 



P\n\ ■'- 



lim E 

N^oo 
lim 

' 



Q^^{X,Y)Bn,nPp{X,Y) 



N^oo 



E 



Q'^;nX,Y)BN,NPp{X,Y) 
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But Bn^nVp is positive, so (i) is proved. 
Now rewrite 



P\n\ 



I „l N N 



JO 



i=l j=l 



N N 



J2 E N)DM^{j; N)pp 



i=i j=i 



E 



Qf'f(x,y) I i,j 



(a + /3 + iV)(l„l) 



-E 



(7.123) 



for /, J are independent DMa^p{-]N) random variables. The last equality follows from (3.41). 
Since pp is positive, from (7.123) it follows that 

f (a + /3 + iV)(|„|) -| 

is, for every A^, a-HPDS. But by Lemma 7.7, we can multiply every term of the sequence by 
the HPDS (7.122) where we set \m\ = N, to obtain (ii). 

□ 

The next Proposition shows some mappings from Hahn to Jacobi PDSs. It is in some sense a 
converse of Proposition 7.6 under the usual (extended) Gasper constraints on a. 

Proposition 7.9. If a satisfies the conditions of Proposition 5.4, let {p\n\} be a-HPDS for 
some integer N. Then both {p\n\} o,nd (7.120) are positive- definite for Da- 
Proof If 

oo 

5^PHi^|°,(r,s)>0 

|n|=0 

for every r, s G A^A(^„i), then Proposition 3.2 implies that 



\n\=0 



Now consider the Hahn polynomials re-normalized so that 

^1 q;-i°i-°^(x,i) 



h' 



ai .|q:|— ai 
n\ 



(r-N) 



_J 

g;-i"i-"^(i,i) 



Da+r{dx) 



Then it is easy to prove that 
and 



E 



■/^P|l"l^"^(i?;iV) 



h 



Nu 



|n| =0,1,, 
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(see also [17], (5.65).) Hence 



oo 
|n|=0 



P\n\ ^^^7^ C\n\ ^|n| (^l?^) =■ f^ir)- 

So for every \n\, 



\n\=0 



P\n\ 



K 







/^(i?)/iPl'l"'-"^(/?;iV) 
(t>N{x)R\^\{x)D^{dx) (7.124) 



where 



r=0 ^ ^ ^ 

hence, by Gasper's theorem (Theorem 7.2) /9|„| is (ai, |a| — ai)-JPDS. Therefore, by Proposition 
7.5, it is also a-JPDS. Finally, from the form of ^|^| we know that 

kl[|n|]/(|a| + kl)(|„|) = ^|^|(n) 
is a-JPDS, thus (7.120) is JPDS. 

□ 

Remark 7.10. Notice that 



+ kl){H) 

is itself a positive- definite sequence for Da- This is easy to see directly from the representation 
(2.23) o/^i^i (we will consider more of it in Section 8). 

Since products of positive- definite sequences are positive definite sequences, then we have, as a 
completion to all previous results. 

If {/0|n|} is positive-definite for D^, then (7.120) positive-definite for both Da and DMa- 



8 A probabilistic derivation of Jacobi positive-definite sequences. 

In the previous sections we have found characterizations of Dirichlet positive-definite sequences 
holding only if the parameters satisfied a particular set of constraints. Here we show some 
sufficient conditions for a sequence to be a-JPDS, not requiring any constraints on a. This is 
done by exploiting the probabilistic interpretation of the Orthogonal Polynomial kernels. Let 
us reconsider the function From (2.21) we can write, for every \m\ E N, and x,y £ A^_i, 

C\m\{x,y)Da{dy) = O^'l^^x'Da+iidy) (8.125) 

|«| = |m| ^ ^ 
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This is, for every |m|, a transition kernel, expressed as a mixture of posterior Dirichlet distri- 
butions, with multinomial mixing measure. Similarly, because of the symmetry of for ^^ly 
fixed |m|, the bivariate measure 

B1^a,\m\{dx,dy) := C\m]{x,y)Da{dx)Da{dy) (8.126) 

so ^1^1(2^5?/) has the interpretation as a (exchangeable) copula for the joint law of two vectors 
{X,Y), with identical Dirichlet marginal distribution. Such a joint law can be simulated via 
the following Gibbs sampling scheme: 

(i) Generate a vector X of Dirichlet{a) random frequencies on d points. 

(a) Conditional on the observed X = x, sample \m\ iid observations with common law x. 

(Hi) Given the vector I of counts obtained at step (ii), take Y as stochastically independent of 

X and with distribution Da+i{dy). 

The bivariate measure B'Da,\m\ its infinite-dimensional extension has found several applica- 
tions in Bayesian Statistics e.g. by [31], but no connections were made with orthogonal kernel 
and canonical correlation sequences. A recent important development of this direction is in [33]. 

Now, let us allow the number |m| in the above procedure to be random, i.e. for a probability 
distribution : = 0, 1, 2, . . .} on N we modify step (ii) to 

(a)' Conditional on the observed X = x, with probability sample \m\ iid observations with 
common law x. 

Then the above Gibbs-sampling procedure leads us to a new exchangeable joint distribution, 
with identical Dirichlet marginals and copula given by 

00 

BVo,,d{dx,dy) = E [BV^^\M\{dx,dy)\ = ^ d\,^\i'^^^{x,y)D^{dx)D^{dy). (8.127) 

|m|=0 

The probabilistic construction has just led us to prove the following 

Proposition 8.1. Let {d\m\ : /n, = 0, 1, . . .} be a probability measure on {0,1,2,...}. Suppose 
that, for every \9\ > 0, the series 

P\n\= E (igl + ltn ^l-l' H =0,1,2,... (8.128) 

converges. Then {p|„|} is a-JPDS for every d and every a G M'^ such that |a| = \9\. 
Proof. Note that 

00 

|m|=0 
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is always true for every probability measure 

Now reconsider the form (2.23) for the (positive) function (^^^ : we can rewrite (8.127) as 



< Yl dimiC\L\ix,y) 

\m\=0 
oo 



l\n\] 



_|_ ) "hi 
|m|=0 |n|<|m| ' ^ ^ ^^(H) 



E 

|n|=0 



\ ^ i""'iLI"IJ J 



m|>| 



\n\=0 



i.l29) 



□ 



and since (8.129) does not depend on the dimension of a, then the proposition is proved for 

Da. 

Example 8.2. Take = the probability assigning full mass to \l\. Then 

Ulrinii 



P\n\ = Y 
m|>|n 



m 



+ \m\ 



l)(H)"""" (l^l + KI)(H) 



> n 



.130) 



and by Proposition 2.1, 



Y p\n\QW^y) = Y m l Z Q\n\i^^y) = m^^y)- (8-i3i) 



|n|=0 |n|=0 ' ' '""^d"!) 

Example 8.3. Consider, for every t >0, the probability distribution 



\n\>\n\ 



.132) 



where (flj^||„|) is the invertible triangular system (2.20) defining the polynomial kernels Qj^j in 
Proposition 2.1. Since the coefficients of the inverse system are exactly of the form 



\m\ 



[\nW 



(l^l + IH)(H) 



, |m|, |n| = 0, 1, 2, . . . 



then 



P\n\{i) 



e 2 



\\n\{\n\ + \a\-l)t 



is for every t a positive- definite sequence. In particular, it is the one characterizing the neutral 
Wright-Fisher diffusion in Population Genetics, mentioned in section 1.1., whose generator has 
eigenvalues —\\n\{\n\ + \a\ — 1) and orthogonal polynomial eigenf unctions. 
The distribution (8.132) is the so-called coalescent lineage distribution (see [15], [16]), i.e. the 
probability distribution of the number of lineages surviving up to time t back in the past, when 
the total mutation rate is \a\ and the allele frequencies of d phenotypes in the whole population 
are governed by A\a\,d- 
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Example 8.4 (Perfect independence and dependence). Extreme cases of perfect dependence 
or perfect independence can be obtained from Example 8.3 when we take the limit as t ^ or 
t — oo, respectively. In the former case, (i|^|(0) = 5|m|oo that p\n\{0) = 1 for every \n\. The 
corresponding bivariate distribution is such that 

Eo{Qn{Y)\X = x) = Qn{x) 

so that, for every square-integrable function 

f ~ ^ ^ CnQn 

n 

we have 

Eo{f{Y)\X = X) = Y,CnQn{x) = f{x) 

n 

that is, |o} is in fact the Dirac measure 5{y — x). 

In the latter case, (i|m|(oo) = that p|„|(oo) = for every \n\ > 1 and 'EQ{Qn{Y)\X = 

x) = E implying that 

E^{f{Y)\X = x)=E[f{Y)] 
i.e. X,Y are stochastically independent. 

8.1 The infinite-dimensional case. 

Proposition 8.1 also extends to Poisson-Dirichlet measures. The argument and construction 

;P I with ^f'^'' 



are the same, once one replaces i with Ct}!!\'°°- We only need to observe that because the 



functions 

forming the terms in ^1^1^''°° (s^e (4.66)), are probability measures on |m|A;^, then the kernel 

defines, for every x, a proper transition probability function on A;^, allowing for the Gibbs- 
sampling interpretation as in Section 8, but modified as follow: 

(i) Generate a point X in with distribution PD(^\6\'^ . 

(a) Conditional on the observed X = x, sample a partition of \m\ with distribution function 

(Hi) Conditionally on the parition I obtained at step (ii), take Y as stochastically independent 
of X and with distribution 

(H)tt(0[x,/]PZje(dj/) 
ESF\e\{l) 

Thus the proof of the following statement is now obvious. 

Proposition 8.5. Let {d\m\ : m = 0, 1, . . .} be a probability measure on {0, 1, 2, . . .}. Suppose 
that, for every \6\ > 0, the series 

P\n\= E n^l^l'S! ^IH' |n| =0,1,2,... (8.133) 

\m\>\n\ ^l^l + l"^l^(M) 

converges. Then {p\n\} is a positive- definite sequence for the Poisson-Dirichlet point process 
with parameter \6\ 
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8.2 From Jacobi to Hahn positive-definite sequences via discrete distribu- 
tions. 

We have seen in proposition 7.9 that Jacobi positive-definite sequences {p|„|} can always be 
mapped to Hahn positive-definite sequences of the form {p\n\ Qa\+N) — '^^^ show that a 
JPDS {p|„|} is also HPDS when it is the image, via (8.128), of a particular class of discrete 
probability measures. 



form 



Proposition 8.6. For every N and \9\ > 0, let p^^^ = : \n\ G Z+j be of the same fo 

(8.128) for a probability mass function d^^^ = {d^rn\ ■ ^ ^+} such that d\l\ = for every 
\l\ > N. Then p^^^ is a-JPDS if and only if it is a-HPDS for every d and a G such that 

\a\ = \e\. 



Proof. By Lemma 7.7, the sequence 



(l«l +"^)(|n|) 



is HPDS (to convince oneself, take p as in Example 8.3 or in Example 8.4 and take the limit as 
t — )• or z — 7- 1 respectively). 

Now replace \m\ with a random M with distribution given by d^^\ Then 



< E 



E 

|n|=0 



{\a\+M) 



-m 



i\n\) 



N 



N 



EE" 

|n|=0 \|m| = |n| 



(JV) 



IMI 



I*"! (|a| +M) 



Tja 



which proves the "Hahn" part of the claim. The "Jacobi" part is obviously proved by Proposition 
8.128. □ 



9 From positive-definite sequences to probability measures. 

In the previous Section we have seen that it is possible to map probability distributions on Z+ 
to Jacobi positive-definite sequences. It is natural to ask if, on the other way around, JPDS's 
{/0|„|} can be mapped to probability distributions {(i[m|} on Z+, for every m = 0, 1, . . . , via the 
inversion 

oo 

d\m\{p) = ""MmfH ^^-^^^^ 

n=m 

In this Section we give some sufficient conditions on p for (i|m|(/o) to be nonnegative for every 
|m| = 0, 1, . . ., and an important counterexample showing that not all JPDS can be associated 
to probabilities. We restrict our attention to the Beta case {d = 2) as we now know that, if 
associated to a probability on Z+, any JPDS for d = 2 is also JPDS for d > 2. 
Suppose p = {p\n\}^\=o satisfies 

oo 

Pp{x,y) ■■= Y p\n\Q"n\i^^y) ^ (9.135) 

|n|=0 
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and in particular 



Pp{x) ■.= pp{x,l) > 0. 



(9.136) 



Proposition 9.1. // all the derivatives of Pp{x) exist, then d^m\{p) ^ for every \m\ G if 
and only if all derivatives of Pp{x) are non-negative. 

Proof. Rewrite d|m|(/o) as 

oo 

d\m\{p) = ^ 0'\v\ + \m.\,\m\PH + \"^\ 

oo 

E|6»|+2|m| 
"11)10 ^I'l-l + lr 



\v\=0 



+ I"T-I)(|m.|) \^ ^|6»|+2|m|_ 

\m\\ 



|m| =0,1,.... (9.137) 



\v\=0 



This follows from the general identity: 



\u\ 



+ + ljl)(ljl)- 



"'bl+lil>l+b1 ~ "I^M«I {\u\ + \j\y. 
Now consider the expansion of Jacobi polynomials. We know that 

n 

|m|=0 



(9.138) 



(9.139) 



where 



i=0 



m\ 



(xyYlil - x){l - y)]\ 



«(i)/3(|m|-r) 
l^l{H) 



Since -R|^|^(l) = 1 and Cpij*(0, 1) = S\m]o then 



Therefore (9.137) becomes 



|d|=0 



Now apply e.g. [[18], (4.3.2)] to deduce 
For |m| = 1, 



"(|m|) 



(9.140) 

0,1,... 

(9.141) 

(9.142) 



P\v\+1 



Pp{x)R'^f^^{x)Da,i3{x)dx 



a 



(2) Jo 



Pp{x) 



jy 

dx I'' 



x)Da+i,/3+i(x) 



1^1(2) Jo \dx 



Ppi^) ] RZt'^''^^'^ ix)Da+i,p+iix)dx. 
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The last equality is obtained after integrating by parts. Similarly, denoted 



\m\ = 0, 1, . . . 



it is easy to prove that 



P\v\ + \m.\ 

So we can write 



(2|m|) Jo 



W\{\m\) 



(9.143) 



Thus if Pp > 0, then is, for every \m\, non-negative and this proves the sufficiency. 

For the necessity, assume, without loss of generality, that {d^rn\{p) '■ ^ S Z+j is a probability 
mass function on Then its probability generating function {pgf) must have all derivatives 
nonnegative. For every < 7 < the pgf has the representation: 

00 



\m\=0 



'7,|e|-7 



m|=0 



'7,|e|-7 



|m|=0 



1^7,191-7 [Pp{sZ)] 



(9.144) 



where Z is a Beta(7, \9\ —7) random variable. Here the second equality follows from the identity: 

(9.145) 



^xH=er4(x,l), a,/3>0 
«(l-l) ' ' 



and the third equality comes from (9.139). 
So, for every /c = 0, 1, . . ., 



< 



ds 



^^(.)=E,,|e|_, [zlSf ^(s^) 



(9.146) 



for every 7 G (0, \9\). Now if we take the limit as 7 — t- Z — t- 1 so, by continuity. 



E, 



7,|0|-7 



■y-,\0\ 



preserving the positivity, which completes the proof. 



□ 



9.1 A counterexample. 

In Gasper's representation (Theorem 7.2), every positive-definite sequence is a mixture of Jacobi 
polynomials, normalized with respect to their value at 1. It is natural to ask whether these 
extreme points lead themselves to probability measures on Z_|.. A positive answer would imply 
that all positive-definite sequences, under Gasper's conditions, are coupled with probabilities 
on the integers. Rather surprisingly, the answer is negative. 
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Proposition 9.2. Let a,/3 > satisfy Gasper's conditions. The function 



d\m\= Yl «H|m|^Sf(^)' |m| =0,1,2,... 

n|>|r7i| 

is not a probability measure. 
Proof. Rewrite 

oo \n\ 

Ms) = Y.RnH-) E 

n=0 |m|=0 
oo 

= E 5] cr„f i?rni (^)^H (^^)' (9-147) 

|n|=0 

where is a Beta (a, /3) random variable. This also shows that, for every x, 

^ n=0 

i.e. the Dirac measure putting all its unit mass on x (see also Example 8.4). 

Now, if (pxis) is a probability generating function, then, for every positive L2 function g, any 

mixture of the form 

q{s) = / g{x)(l)x(s) — — dx 

Jo ^[o:,P) 

9{ws) ^ 7 dw (9.148) 

must be a probability generating function, i.e. it must have all derivatives positive. However, 
if we choose g{x) = e~^^, then we know that, g being completely monotone, the derivatives of 
q will have alternating sign, which proves the claim. □ 
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